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Streaming media and its applications 


The use of streaming media has been growing among both businesses and 
consumers. Gartner estimates that as of February 2000, about 52 million U.S. 
adult consumers tried using streaming media at some point. As of September 
1999, about 17% of U.S. businesses were using streaming on their websites for 
various applications. Gartner expects this figure to grow to 47% by 2001. Figure 
1 shows the various applications that streaming media is being used for. 


Figure I 
Streaming Media Applications 


62% 62% 


37% 37% 


16 November, 2000 Gartner Group 





Source: survey of 200 US businesses; September 1999; base: those using streaming 


The number of BtoC sites that are using streaming media has also proliferated in 
the last couple of years. At the top are news sites and internet radio stations. 
There are approximately 5,000 internet radio stations today worldwide. While 
majority are internet only radio stations, about 200 of the 10,000 tradional radio 
stations in the U.S. are also streaming their shows. Approximately 30 million 
U.S. consumers are listening to internet radio stations today. Typically the listen 
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time is ап hour and many are corporate employees that listen during a lunch 
break. 


Interactive Streaming Platform 


Streaming is a non-interactive technology that allows video/audio to be sent as 
packets that are buffered at the client for a higher performance and quality to 
overcome the latency issues of a packet based network. A number of 
applications have been built integrating streaming media in the past year for web 
based communication such as mShow, WorldStream and NetPodium (now part 
of Akamai). These applications provide a certain level of non-multimedia 
interactivity, ie. Text chat based interaction that allows the participants to interact 
with the presenter. However, they do not allow for voice interactivity. Text based 
interactivity is beneficial in environments in which PCs do not come with 
microphone as is typical of many corporations. However, it is cumbersome and 
slow. 


VocaLoca’s interactive streaming platform enables voice based interactivity that 
is tightly integrated with streaming media. For example, a streaming media show 
can now have “talk” buttons for the audience to ask questions verbally and for the 
show host to respond similarly thus bringing internet communication closer to 
natural verbal communication. 


How VocaLoca’s interactive streaming 
platform works 


Architecture 


Figure 2 shows the basic architecture for VocaLoca’s interactive streaming 
platform and its services based on this platform. Its architecture consists of a 
Studio-in-the-Sky (SITS)™ that can be accessed by a user (broadcaster or a 
listener) with just a web browser and a telephone or a microphone. The SITS™is 
a co-location at Exodus that consists of- 


Ф 750 encoding engines (to encode incoming voice over IP stream into 
streaming media) deployed currently that could be scaled relatively quickly in 
number 

Ф mixing engines (for synchronizing audience talkback or audio visual 
advertisements within broadcaster stream) 

• Talk-back engines 

Ф Arrays of web logic servers running on Network engine 10 devices to serve 
broadcaster and user interfaces 

• Load balanced with Alteon and Legato load balancers 
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The SITS™ also includes а Storage Area Network built around Network 
Appliance Devices. 


Figure 2 
VocaLoca Studio in the Sky (SITS)™ 
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How it works 


Table 1 shows VocaLoca’s positioning relative to other companies on two 
dimensions — voice over IP and streaming. The applications that voice over IP 
and streaming media are useful for are different and to a certain extent 
contingent on bandwidth and infrastructure limitations. They are also different in 
the markets they are targeting although there is some overlap. 


Companies like Net2Phone are point to point voice over IP applications that are 
mainly aimed at replacing current legacy long distance. Other voice over IP 
applications such as HearMe are multipoint in nature and allow for applications 
such as CRM, conferencing. However, they are focused on realtime talk like a 
telephone. 


Streaming media by itself is non-interactive. Any interaction that is incorporated 


today in applications such as NetPodium (now part of Akamai) is through text 
based chats. 
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Моса! оса combines the two technologies (multipoint voice over IP and 
streaming) to offer live and time-shifted, interactive rich media broadcasts. 














Table 1 
Technology Landscape 
Technology | Point to Multi-Point Interactive Streaming 
point VoIP VoIP broadcasting 
Application | Long Live talk Live and time- | Rich media 
distance interaction shifted, broadcasts 
displacement | and recorded | interactive, rich 
(voice only); | media 
broadcasts 
(more visual) 
Market Long CRM, eLearning,CR | Media, B2B 
segments Distance VoiceChat, M, Corporate broadcasts 
Communities | communication 
Business , training 
audio 
conferencing 
Companies | Net2Phone LivePerson; | NetPodium, RealNetworks 
Fire Talk; mShow, ; Microsoft; 
HearMe; VocaLoca Apple; 























VocaLoca’s architecture marries the best of streaming technology with the best 
of voice over IP technology. Table 2 shows the differences between VocaLoca’s 
interactive streaming and voice over IP. Streaming technology provides better 
audio quality than voice over IP due to the fact that it buffers to reduce latencies 
in the IP network. Voice over IP, on the other hand enables direct interaction, 
something that is not possible with streaming today. VocaLoca’s architecture 
enables switching and conversion between the two technologies to make 
interactive streaming possible. 

Table 2 


VocaLoca - Marrying the best of streaming with the best of voice over IP 








Features Voice over IP VocaLoca 
Interactive 
streaming 

Audio Talkback Relatively higher 

quality quality audio; talk, 


talkback, music 
and rich media 
Interaction | One to one; many to One to many; 
many; moderated 
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interaction; 




















Time Live Live, prerecorded, 
shifting on-demand 

Push Limited live Live, prerecorded, 
content on-demand 
Session 2 to 15,000 per MCU; 2 to infinite 
scalability | Bidirectional scalability; (subject to 


capacity on 
content delivery 
networks) 
unidirectional 
scalability since it 
takes advantage of 
streaming. Also 
bidirectional; 
However, 
bidirectional 
scalability (i.e. 
number of 
participants that 
can use “talkback” 
features and get a 
response fast) 
limited due to use 
of voice over ІР 
technology and 
nature of talkback 
function that 
queues up 
participants; 
However, this 
should not be a 
disadvantage as 
99% of participants 
in a broadcast are 
usually passive 
listeners; 





In traditional radio broadcasts, there is a time lag between when an audience 


question is answered and when the rest of the audience gets to hear it. 


VocaLoca makes use of a similar system and incorporates it within its streaming 


broadcasts. 


The broadcaster can use either a PC microphone or the telephone along with the 
broadcaster interface (VocaHost booth) served up by the WebLogic servers to 
conduct the show. The broadcaster is always in the voice over IP mode. The 
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audio is then encoded on the fly by the encoding engines а the Exodus co- 
location facility and streamed to a content delivery network from where it reaches 
the audience in the streaming format. The VocaLoca interactive player allows 
the listener to listen and view the broadcast/presentation. 


If the listener wants to talk back with a microphone, a HearMe talkback 
component is automatically downloaded and installed on the client. This is a 
one-time install and is about 100k in size. When the listener presses the talk- 
back button, he/she enters the voice over IP mode which is synchronized at the 
SITS™ and delivered as a single stream to the encoding engines for conversion 
to the streaming format and then streamed across the content delivery networks. 
When the listener and/or the broadcaster host uses the telephone as the voice 
input device, it is converted to voice over IP at the gateway (to be installed at the 
Exodus co-location) and then sent to the mixing/encoding engines. 


Currently VocaLoca only supports Real media streaming but will extend support 
for Microsoft windows media, QuickTime and MP3 in the near future. VocaLoca 
uses Activate as its content delivery network today but can use all of the content 
delivery networks available today to increase its reach and scalability. Gartner 
estimates that the capacity for all content delivery networks put together is for 
about a million simultaneous 56kbps streams as of 2000. 


Features 


VocaLoca technology has the following distinctive features: 

Ф Low technology barrier for all participants — hosts and listeners 

Ф Itis easily brandable and linkable. Any site interested in offering such 
services only needs a user interface. The rest is taken care of at the SITS™ 
thus requiring minimal effort in deployment. 


+ It supports RealMedia today, but will support windows and QuickTime in the 
future. 

• Makes streaming media applications voice interactive 

• Сап create both live and time-shifted content 

Ф Will accept audio input from either a PC microphone or a standard telephone 


+ It provides the ability for the broadcaster to push URLs with the broadcast. 
These URLs may refer to static web pages, web forms, or web rich media 
including video clips. 

• The SITS™also mixes in audio/visual synchronized advertisements with 
regular schedules. 

+ Shows created may be designated as private requiring a password or hidden 
for greater security. 

• Shows/presentations may be displayed as simply as by placing hyperlinks on 
web pages, serving links dynamically when the show is on or by sending 
emails (or messaging) to invitees with embedded links. 
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Target market segments 


VocaLoca is targeting three primary segments of the market. 
ELearning/Training: 

Streaming is already being used for training by 7% of U.S. businesses as of 
September 1999 (42% of those using streaming — i.e. 17% of U.S. businesses 
are using it for training). Gartner expects web based communication market to 
grow at a compounded annual growth rate of 59%. Also Interactive 
broadcasting/web conferencing was used by 18% of U.S. businesses as of 1999 
and is expected to grow to 40% by 2001-year end. Most applications today either 
offer streaming only or voice based interaction only or streaming with chat based 
interaction. Although some offer streaming along with interaction on telephone, 
VocaLoca is the only company offering tight integration of streaming with voice 
over IP. 


Gartner predicts that by 2003, half of all IT professional training will be delivered 
via e-learning (.7 probability). Gartner estimates the percentage of IT training 
budget devoted to e-learning to increase from 13% in 2000 to 18% in 2001. It 
could be as high as 30% to 40% in financial, distribution, K-12 through college, 
food products, and hardware/network equipment industries (Source: Gartner M 
11-5734; September 2000). 


Customer relationship Management: 


Gartner Dataquest expects the CRM services market to top $7 billion in revenue 
globally in 2000, and grow to more than $20 billion by 2003 at a compound 
annual growth rate of over 48 percent. The growing importance of the Internet as 
a channel of communication has been a critical driver for increasing investment 
in Internet-based CRM strategies. Although field sales (82 percent) still remains 
the primary channel of customer communication, the Web (74 percent) and e- 
mail (76 percent) have grown rapidly into prominence (Source: Gartner 
Dataquest CARE-WW-MT-0001; November 2000). 


VocaLoca extends the use of streaming to customer relationship management by 
making it interactive. Although videoconferencing is being touted as a solution 
for this, streaming has made more inroads. Voice over IP is another technology 
that is offered as a solution for this space. However, it is missing the video 
element which is likely to become more important down the road especially 
among BtoC businesses. 


Ecommerce 
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Despite the current rumblings in the dot com space with Ше demise and layoffs, 
ecommerce is expected to be an integral and growing segment of the economy. 
According to Gartner, the North American internet retailing market grew 157% in 
1999 and is forecast to reach $142 billion in 2004. The volume of non-financial 
goods and services sold through business to business e-commerce is expected 
to reach $7.29 trillion worldwide in 2004, about 7% of the world economy forecast 
for that year (Source: Gartner Dataquest • EBTB-EU-DP-0002. February 2000). 


There are numerous ecommerce applications that can benefit from VocaLoca’s 
technology. Some examples include : 
e classifieds, 
e sites such as ebay, 
e for real estate agents to show their properties and interact with potential 
customers at the same time, 
match making sites, as well as, 
streaming media advertisements that provide a link to click on to talk to a live 
person for more information. 


VocaLoca’s Business model 


VocaLoca’s major investments are in its co-location facilities with Exodus which 
consists of its studio in the sky. Once the streams are encoded, they are pushed 
onto the content delivery network for distribution. Currently, VocaLoca uses 
Activate as its content delivery network. However, it can scale easily by using 
other content delivery networks that provide streaming today. Gartner estimates 
that the current capacity for number of simultaneous streams (56K) between 
Akamai, Digital Island and Беат is approximately 800k. With other players 
coming into the market and also with the current providers adding to their 
capacity, this is only likely to increase in the future. In addition, the average 
selling price for streaming video is about half a penny per MB today. This is also 
declining and is expected to go down to a tenth of a penny in the next couple of 
years. 


VocaLoca intends to charge for its services. Deployment takes very little effort 
for the customers due to its link and launch features. All that is required is 
creating the appropriate branded interface with all encoding and mixing and 
streaming done at the SITS™ and the content delivery network chosen. 
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